An overview of nitech HMM-based speech synthesis system for blizzard challenge 2005

نویسندگان

  • Heiga Zen
  • Tomoki Toda
چکیده

In the present paper, hidden Markov model (HMM) based speech synthesis system developed in Nagoya Institute of Technology (Nitech-HTS) for a competition of text-to-speech synthesis systems using the same speech databases, named Blizzard Challenge 2005, is described. We show an overview of the basic HMM-based speech synthesis system and then recent developments to the latest one such as STRAIGHT-based vocoding, hidden semi-Markov model (HSMM) based acoustic modeling, and parameter generation considering global variance are illustrated. Constructed voices can synthesize speech around 0.3 xRT (real time ratio) and their footprints are less than 2 MB. The listening test results show that performances of our systems are much better than we expected.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Details of the Nitech HMM-Based Speech Synthesis System for the Blizzard Challenge 2005

In January 2005, an open evaluation of corpus-based textto-speech synthesis systems using common speech datasets, named Blizzard Challenge 2005, was conducted. Nitech group participated to this challenge with a newly designed HMM-based speech synthesis system (Nitech-HTS 2005). In the present paper, technical details, building processes, and the performance of the Nitech-HTS 2005 voices are des...

متن کامل

An Overview of Nitech HMM-based for Blizzard Challen

In the present paper, hidden Markov model (HMM) based speech synthesis system developed in Nagoya Institute of Technology (Nitech-HTS) for a competition of text-to-speech synthesis systems using the same speech databases, named Blizzard Challenge 2005, is described. We show an overview of the basic HMM-based speech synthesis system and then recent developments to the latest one such as STRAIGHT...

متن کامل

The Nitech-NAIST HMM-Based Speech Synthesis System for the Blizzard Challenge 2006

The present paper describes an HMM-based speech synthesis system developed by the Nitech-NAIST group for the Blizzard Challenge 2006 (Nitech-NAIST-HTS 2006). To achieve improvements over the 2005 system (Nitech-HTS 2005), new features such as MGC-LSP, MLLT, and full covariance GV pdf are investigated. Subjective listening test results show that combining mel-cepstral coefficients, MLLT and full...

متن کامل

Overview of NITECH HMM - based text - to - speech system for Blizzard Challenge 2014

This paper describes a hidden Markov model based text-tospeech (TTS) system developed at the Nagoya Institute of Technology (NITECH) for Blizzard Challenge 2014. The tasks of Blizzard Challenge 2014 are speech synthesis of six Indian languages and multilingual speech synthesis, i.e., Indian language and English. Only Indian language speech data and text are provided as training data. We focused...

متن کامل

Overview of NITECH HMM - based speech synthesis system for Blizzard Challenge 2013

This paper describes a hidden Markov model (HMM) based speech synthesis system developed for the Blizzard Challenge 2013. In the Blizzard Challenge 2013, audiobooks are provided as training data. In this paper, we focus on a construction of databases for training acoustic models from audiobooks. An automatic alignment technique based on speech recognition is used for obtaining pairs of audio an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005